Combining Clustering Approaches for Semi-Supervised Parsing: the BASQUE TEAM system in the SPRML’2014 Shared Task
نویسندگان
چکیده
This paper presents a dependency parsing system, presented as BASQUE TEAM at the SPMRL’2014 Shared Task, based on the combination of different clustering approaches. We create new features applying clustering methods to automatically annotated large corpora. Once these new features are calculated, we add them to the base features in order to create a series of analyzers using two freely available and state of the art dependency parsers, MaltParser and Mate. Finally, we will combine previously achieved parses using a voting approach.
منابع مشابه
Exploiting the Contribution of Morphological Information to Parsing: the BASQUE TEAM system in the SPRML'2013 Shared Task
This paper presents a dependency parsing system, presented as BASQUE TEAM at the SPMRL’2013 Shared Task, based on the analysis of each morphological feature of the languages. Once the specific relevance of each morphological feature is calculated, this system uses the most significant of them to create a series of analyzers using two freely available and state of the art dependency parsers, Mal...
متن کاملAn Unsupervised Text Mining Method for Relation Extraction from Biomedical Literature
The wealth of interaction information provided in biomedical articles motivated the implementation of text mining approaches to automatically extract biomedical relations. This paper presents an unsupervised method based on pattern clustering and sentence parsing to deal with biomedical relation extraction. Pattern clustering algorithm is based on Polynomial Kernel method, which identifies inte...
متن کاملComposite Kernel Optimization in Semi-Supervised Metric
Machine-learning solutions to classification, clustering and matching problems critically depend on the adopted metric, which in the past was selected heuristically. In the last decade, it has been demonstrated that an appropriate metric can be learnt from data, resulting in superior performance as compared with traditional metrics. This has recently stimulated a considerable interest in the to...
متن کاملDependency Parsing: Past, Present, and Future
Dependency parsing has gained more and more interest in natural language processing in recent years due to its simplicity and general applicability for diverse languages. The international conference of computational natural language learning (CoNLL) has organized shared tasks on multilingual dependency parsing successively from 2006 to 2009, which leads to extensive progress on dependency pars...
متن کاملRobust Multilingual Named Entity Recognition with Shallow Semi-Supervised Features
We present a multilingual Named Entity Recognition approach based on a robust and general set of features across languages and datasets. Our system combines shallow local information with clustering semi-supervised features induced on large amounts of unlabeled text. Understanding via empirical experimentation how to effectively combine various types of clustering features allows us to seamless...
متن کامل